Sub-banded reconstructed phase spaces for speech recognition
نویسندگان
چکیده
A novel method combining filter banks and reconstructed phase spaces is proposed for the modeling and classification of speech. Reconstructed phase spaces, which are based on dynamical systems theory, have advantages over spectral-based analysis methods in that they can capture nonlinear or higher-order statistics. Recent work has shown that the natural measure of a reconstructed phase space can be used for modeling and classification of phonemes. In this work, sub-banding of speech, which has been examined for recognition of noise-corrupted speech, is studied in combination with phase space reconstruction. This sub-banding, which is motivated by empirical psychoacoustical studies, is shown to dramatically improve the phoneme classification accuracy of reconstructed phase space-based approaches. Experiments that examine the performance of fused sub-banded reconstructed phase spaces for phoneme classification are presented. Comparisons against a cepstral-based classifier show that the proposed approach is competitive with state-of-the-art methods for modeling and classification of phonemes. Combination of cepstral-based features and the sub-band RPS features shows improvement over a cepstral-only baseline.
منابع مشابه
Classification of emotional speech using spectral pattern features
Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...
متن کاملStatistical Models of Reconstructed Phase Spaces for Signal Classif
– This paper introduces a novel approach to the analysis and classification of time series signals using statistical models of reconstructed phase spaces. With sufficient dimension, such reconstructed phase spaces are, with probability one, guaranteed to be topologically equivalent to the state dynamics of the generating system, and therefore may contain information that is absent in analysis a...
متن کاملPhoneme Classification Using Naive Bayes Classifier in Reconstructed Phase Space
A novel method for classifying speech phonemes is presented. Unlike traditional cepstral based methods, this approach uses histograms of reconstructed phase spaces. A Naïve Bayes classifier uses the probability mass estimates for classification. The approach is verified using isolated fricative, vowel, and nasal phonemes from the TIMIT corpus. The results show that a reconstructed phase space a...
متن کاملPage 1 A Combined Sub-band and Reconstructed Phase Space Approach to Phoneme Classification
This paper presents a method of classifying phonemes by combining a dynamical system approach with sub-band decomposition of speech signals. The ability of reconstructed phase spaces to effectively model sub-bands of phonemes in different phonological classes is studied. The current results are taken from a small speaker-independent set. For the final version of this paper, the entire TIMIT dat...
متن کاملA combined sub-band and reconstructed phase space approach to phoneme classification
In this paper a method of classifying phonemes by combining a dynamical systems approach with subband decomposition of speech signals is presented. The ability of reconstructed phase spaces to effectively model sub-bands of phonemes in different phonological classes is demonstrated. Experiments performed over the TIMIT database show how well phonemes from different phonological classes can be r...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Speech Communication
دوره 48 شماره
صفحات -
تاریخ انتشار 2006